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O ^ Abstract 



Polynomial time preprocessing to reduce instance size is one of the most commonly deployed 
heuristics to tackle computationally hard problems. In a parameterized problem, every 
instance / comes with a positive integer k. The problem is said to admit a polynomial kernel 
if, in polynomial time, we can reduce the size of the instance / to a polynomial in fc, while 
preserving the answer. In this paper, we show that all problems expressible in Counting 
Monadic Second Order Logic and satisfying a compactness property admit a polynomial 
kernel on graphs of bounded genus. Our second result is that all problems that have finite 
integer index and satisfy a weaker compactness condition admit a linear kernel on graphs of 
bounded genus. The study of kernels on planar graphs was initiated by a seminal paper of 
J> ■ Alber, Fellows, and Niedermeier [J. ACM, 2004} wn0 showed that Planar Dominating 

Set admits a linear kernel. Following this result, a multitude of problems have been shown 
to admit linear kernels on planar graphs by combining the ideas of Alber et al. with problem 
specific reduction rules. Our theorems unify and extend all previously known kernelization 
^-j^ , results for planar graph problems. Combining our theorems with the Erdos-Posa property we 

obtain various new results on linear kernels for a number of packing and covering problems. 

On ' 
O 

1 Introduction 

• rH 

Preprocessing (data reduction or kernelization) as a strategy of coping with hard problems is 
universally used in almost every implementation. The history of preprocessing, like applying 
reduction rules to simplify truth functions, can be traced back to the 1950's |49j . A natural 
question in this regard is how to measure the quality of preprocessing rules proposed for a 
specific problem. For a long time the mathematical analysis of polynomial time preprocessing 
algorithms was neglected. The basic reason for this anomaly was that if we start with an instance 
/ of an NP-hard problem and can show that in polynomial time we can replace this with an 
equivalent instance /' with \I'\ < \I\ then that would imply P=NP in classical complexity. The 
situation changed drastically with advent of parameterized complexity. Combining tools from 
parameterized complexity and classical complexity it has become possible to derive upper and 
lower bounds on the sizes of reduced instances, or so called kernels. In fact some times we can 
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solve a computationally hard problem on some special inputs by just using data reduction. In 
this regard, we can trace back to the work of Arnborg et al. [SJ, where they show that each 
problem in monadic second order logic on graphs of bounded treewidth can be solved in linear 
time, by using only reductions, that is, no tree decomposition is built. There are similar ideas 
even in the earlier papers by Fellows and Langston, see for an example [30 } 1311 [32] . 

In parameterized complexity each problem instance comes with a parameter k and the pa- 
rameterized problem is said to admit a polynomial kernel if there is a polynomial time algorithm 
(the degree of polynomial is independent of k), called a kernelization algorithm, that reduces the 
input instance down to an instance with size bounded by a polynomial p{k) in k, while preserv- 
ing the answer. This reduced instance is called a p(k) kernel for the problem. If p(k) = 0(k), 
then we call it a linear kernel. Kernelization has been extensively studied in the realm of param- 
eterized complexity, resulting in polynomial kernels for a variety of problems. Notable examples 
include a 2fc-sized vertex kernel for Vertex Cover [17], a 355k kernel for Dominating Set 
on planar graphs [3], which later was improved to a 67k kernel p. 5], and an 0(k 2 ) kernel for 
Feedback Vertex Set [52] parameterized by the solution size. 

One of the most important results in the area of kernelization is given by Alber et al. [3]. 
They gave the first linear sized kernel for the Dominating Set problem on planar graphs. 
The work of Alber et al. [3j triggered an explosion of papers on kernelization, and in particular 
on kernelization of problems on planar graphs. Combining the ideas of Alber et al. [3] with 
problem specific data reduction rules, kernels of linear sizes were obtained for a variety of 
parameterized problems on planar graphs including Connected Vertex Cover, Minimum 
Edge Dominating Set, Maximum Triangle Packing, Efficient Edge Dominating 
Set, Induced Matching, Full-Degree Spanning Tree, Feedback Vertex Set, Cycle 
Packing, and Connected Dominating Set [HI HH [12l [16l [38l EHl S2 SH [47]. Dominating 
Set has received special attention from kernelization view point, leading to a linear kernel on 
graphs of bounded genus [34J and polynomial kernel on graphs excluding a fixed graph H as a 
minor and on (i-degenerated graphs [2J HS] . We refer to the survey of Guo and Niedermeier [37] 
for a detailed treatment of the area of kernelization. 

Most of the papers on linear kernels on planar graphs have the following idea in common: 
find an appropriate region decomposition of the input planar graph based on the problem in 
question, and then perform problem specific rules to reduce the part of the graph inside each 
region. The first step towards the general abstraction of all these algorithms was initiated by 
Guo and Niedermeier [38] . who introduced the notion of "problems with distance property". 
However, to prove that some problem posses a linear kernel on planar graphs, the approach of 
Guo and Niedermeier still requires constructing reduction rules which are problem dependent. 
Thus all previous work on kernelization was strongly based on the design of problem specific 
reduction rules. In this paper we step aside and find properties of problems, like expressibility 
in certain logic, which allows these reduction rules to be automated. 

Our results can be seen as what Grohe and Kreutzer call algorithmic meta theorems \36\ 
144] . Meta theorems expose the deep relations between logic and combinatorial structures, 
which is a fundamental issue of computational complexity. Also such theorems yield a better 
understanding of the scope of general algorithmic techniques and the limits of tractability. The 
very typical example is the celebrated Courcelle's theorem [20J which states that all graph 
properties definable in Monadic Second Order Logic (MSO) can be decided in linear time on 
graphs of bounded treewidth. More recent examples of such meta theorems state that all 
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first-order definable properties of planar graphs can be decided in linear time [35] and that all 
first-order definable optimization problems on classes of graphs with excluded minors can be 
approximated in polynomial time to any given approximation ratio [23J. Our meta theorems 
not only give a uniform and natural explanation for a large family of known kernelization results 
but also provide a variety of new results. In what follows we build up towards our theorems. 
We first give necessary definitions needed to formulate our results. 

Let Q g be the family of all graphs that can be embedded into a surface £ of Euler-genus at 
most g. Given a graph G embedded on a surface £ of Euler-genus g, and a set S, we define 
R£.(S) to be the set of all vertices of G whose radial distance from some vertex of S is at 
most r. The radial distance between two vertices x, y is the minimum length of an alternating 
sequence of vertices and faces starting from x and ending in y, such that every two consecutive 
elements of this sequence are incident to each other. We say that a parameterized problem 
n C Q g x N is compact if there exist an integer r such that for all (G = (V,E), k) E n, there 
is an embedding of G into a surface £ of Euler-genus at most g and a set S C V such that 
| SI < r ■ k, ~Rq(S) = V and k < |V| r . Similarly, II is quasi- compact if there exists an integer r 
such that for every (G, k) G II, there is an embedding of G into a surface £ of Euler-genus at 
most g and a set S C V such that |S| < r ■ k, tw(G \ R£.(S)) < r and k < \V\ r (by tw(G) we 
denote the treewidth of G). Notice that if a problem is compact then it is also quasi-compact. 

We use Counting Monadic Second Order Logic (CMSO) [6j[19l[ZI], an extension of MSO, as 
a basic tool to express properties of vertex/edge sets in graphs. In fact, it is known that every set 
T of graphs of bounded treewidth is CMSO-definable if and only if T is finite state |45j . Our first 
result concerns a parameterized analogue of graph optimization problems where the objective is 
to find a maximum or minimum sized vertex or edge set satisfying a CMSO-expressible property. 
In particular, the problems considered are defined as follows. In a p-MiN-CMSO graph problem 
II Q Q g x N, we are given a graph G = (V, E) and an integer k as input. The objective is to decide 
whether there is a vertex/edge set S of size at most k such that the CMSO-expressible predicate 
Pr(G, S) is satisfied. In ap-EQ-CMSO problem the size of S is required to be exactly k and in a 
p-MAX-CMSO problem the size of S is required to be at least k. The annotated version IP of a 
P-min/eq/max-CMSO problem II is defined as follows. The input is a triple (G = (V, E),Y, k) 
where G is a graph, Y C V is a set of black vertices, and A; is a non-negative integer. In the 
annotated version of a p-min/eq-CMSO graph problem, S is additionally required to be a 
subset of Y. For the annotated version of a p-MAX-CMSO graph problem S is not required to 
be a subset of Y, but instead of |S| > k we demand that |S Pi Y\ > k. 

Our results. For a parameterized problem II C Q g x N, let II C Q g x N denote the set of all 
no instances of IT. Our first result is the following theorem. 

Theorem 1. Let II C Q g x N be a p-min/eq/max-CMSO problem and either U or LI is 
compact. Then the annotated version H a admits a cubic kernel if H is ap-EQ-CMSO problem 
and a quadratic kernel ifH is a j>-min/max-CMSO problem. 

We remark that a polynomial kernel for an annotated graph problem LP, is a polynomial 
time algorithm that given an input (G = (V,E),Y,k) of IP, computes an equivalent instance 
(G' = (V, E'), Y', k') of IP such that \ V'\ and k' < k°^ . Theorem[[]has the following corollary. 

Corollary 1. Let II C Q g x N be an HP-complete p-min/eq/max-CMSO problem such that 
either II or IT is compact and IP is in NP . Then LI admits a polynomial kernel. 
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Theorem [T] and its corollary give polynomial kernels for all graph problems on surfaces for 
which such kernels are known. However, many problems in the literature are known to admit 
linear kernels on planar graphs. Our next theorem unifies and generalizes all known linear 
kernels for graph problems on surfaces. To this end we utilize the notion of finite integer index. 
This term first appeared in the work by Bodlaender and van Antwerpen-de Fluiter [13} 124) and 
is similar to the notion of finite state [H 1144 fTD] (see Subsection 12.61 for the formal definitions). 

Theorem 2. Let LT C Q g x N has finite integer index and either II or II is quasi- compact. Then 
II admits a linear kernel. 

Our two theorems are similar in spirit, yet they have a few differences. In particular, 
not every ^-min/eq/max-CMSO graph problem has finite integer index. For an example the 
Independent Dominating Set problem is ap-MiN-CMSO problem, but it does not have finite 
integer index. Also, the class of problems that have finite integer index does not have a syntactic 
characterization and hence it takes some more work to apply Theorem [2] than Theorem [TJ On 
the other hand, Theorem [2] yields linear kernels, applies to quasi-compact problems and unifies 
and generalizes results presented in [21 131 UH 021 051 13H I3SI I3S1 1321 BSl BZ] as a corollary. 

To demonstrate the applicability of our results we show in Section [6] how our theorems lead 
to polynomial or linear kernels for a variety of problems. For ease of reference we provide in the 
Appendix [Aj, a compendium of parameterized problems for which polynomial or linear kernels 
are derived. 

2 Preliminaries 

In this section we give various definitions which we make use of in the paper. Let G = (V, E) 
be a graph. A graph G' = (V 7 , E') is a subgraph of G if V C V and E' C E. The subgraph G' 
is called an induced subgraph of G if E' = {{u, v} G E \ u, v G V'}, in this case, G' is also called 
the subgraph induced by V' and denoted with G[V']. 

2.1 Parameterized algorithms and Kernels 

A parameterized problem II is a subset of T* x N for some finite alphabet T. An instance of 
a parameterized problem consists of (x,k), where k is called the parameter. A central notion 
in parameterized complexity is fixed parameter tractability (FPT) which means, for a given 
instance (x, k), solvability in time f(k) ■ p(\x\), where / is an arbitrary function of k and p is a 
polynomial in the input size. The notion of kernelization is formally defined as follows. 

Definition 1. [Kernelization] A kernelization algorithm, or in short, a kernel for a parame- 
terized problem II C T* x N is an algorithm that given (x, k) £ T* x N outputs in time polynomial 
in \x\ + k a pair (x',k') £ T* x N such that (a) (x,k) £ II if and only if (x',k') £ II and (b) 
W\,k' < g(k), where g is some computable function. The function g is referred to as the size of 
the kernel. If g(k) = k°^ or g(k) = 0(k) then we say that II admits a polynomial kernel and 
linear kernel respectively. 

2.2 Surfaces and Distances 

In this paper we consider graphs embeddable into surfaces. Let Q g be the class of all graphs 
that can be embedded into a surface S of Euler-genus at most g. We say that a graph G 
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is S-embedded if it is accompanied with an embedding of the graph into S. We define the 
normal distance between two vertices x and y to be the shortest path distance between them. 
The radial distance between x and y is defined to be one less than the minimum length of a 
sequence starting from x and ending at y such that vertices and faces alternate in the sequence. 
Given an S-embedded graph G = (V,E) and a set S C V, we denote by R^(5) and Bq(S') 
the set of all vertices that are in radial distance at most r and normal distance at most r from 
some vertex in S respectively. Notice that for every set S C V and every r > 0, it holds that 
B'g(S') C R^" +1 (S') for any embedding of G into a surface S. An alternative way of viewing 
radial distance is to consider the radial graph, Rq: an embedded multigraph whose vertices are 
the vertices and the faces of G (each face / of G is represented by a point Vf in it). An edge 
between a vertex v and a vertex Vf is drawn if and only if v is incident to /. Thus Rq is a 
bipartite multigraph, embedded in the same surface as G. Hence, if G E Q g then Rq E Q g . Also 
the radial distance of a pair of vertices in G corresponds to the normal distance in Rq. 

Let G = (Y,E) be a graph. A tree decomposition of G is a pair (T = (Vt,Et)),X = 
{X t }tev T ) such that: U ug Vr = V,Ve = (u, v) E E, 3t E Vr : {w, C At and Vt> E V, T[{i | v E 
A t }] is connected. The width of a tree decomposition is max tg Vr \Xt\ — 1 and the treewidth of 
G = (V, E) is the minimum width over all tree decompositions of G. A tree decomposition is 
called a nice tree decomposition if the following conditions are satisfied: Every node of the tree 
T has at most two children; if a node t has two children t\ and t2, then Af = Af x = Af 2 ; and 
if a node t has one child t\, then either |Aj| = {X^] + 1 and A^ E Af or |At| = |AtJ — 1 and 
At E Atj . It is possible to transform a given tree decomposition into a nice tree decomposition 
in time 0(\V\ + \E\) [$. 

Given an edge e = («, i>) of a graph G = (V,E), by contracting an edge (u,v) we mean 
identifying the vertices u and v and removing all the loops and duplicate edges. A minor of 
a graph G is a graph H that can be obtained from a subgraph of G by contracting edges. A 
graph class C is minor closed if any minor of any graph in C is also an element of C. A minor 
closed graph class C is H -minor-free or simply H-free if H ^ C. 

2.3 Compactness and Protrusions 

Let Q g be the family of all graphs that can be embedded into a surface S of Euler-genus at 
most 5. We say that a parameterized problem II C Q g x N is compact if there exist an integer 
r such that for all {G = (V,E),k) E II, there is an embedding of G into a surface S of Euler- 
genus at most (7 and a set £ C V such that \S\ < r ■ k, H G (S) = V and k < \V\ r . Similarly, 
II is quasi-compact if there exists an integer r such that for every (G, k) E II, there is an 
embedding of G into a surface S of Euler-genus at most g and a set SCV such that \S\ < r ■ k, 
tw(G \ R£(S)) < r and K |y| r . 

Given a graph G = (V, E) and S 1 C V, we define 3q{S) as the set of vertices in S that have 
a neighbor in V \ S. For a set S C y the neighbourhood of S is Nq(S) = 8q(V \ S). When it 
is clear from the context, we omit the subscripts. We now define the notion of a protrusion. 

Definition 1. [r-protrusion] Given a graph G = (V,E), we say that a set X' C V is an 
r-protrusion of G if \N{X')\ < r and tw(G[A' U N(X')]) < r. 

For an r-protrusion A', the vertex set A = A' U A (A') is an extended r-protrusion. The set 
A is the extended protrusion of A' and X' is the protrusion of A. 
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2.4 Counting Monadic Second Order Logic 

The syntax of MSO of graphs includes the logical connectives V, A, -i, =>■, variables for 
vertices, edges, set of vertices and set of edges, the quantifiers V, 3 that can be applied to these 
variables, and the following five binary relations: (1) u £ U where u is a vertex variable and U 
is a vertex set variable; (2) d £ D where d is an edge variable and D is an edge set variable; 
(3) inc(<i, u), where d is an edge variable, u is a vertex variable, and the interpretation is that 
the edge d is incident on the vertex u; (4) adj(u, v), where u and v are vertex variables u, and 
the interpretation is that u and v are adjacent; (5) equality of variables representing vertices, 
edges, set of vertices and set of edges. In addition to the usual features of monadic second-order 
logic, if we have atomic formulas testing whether the cardinality of a set is equal to n modulo 
p, where n and p are integers such that < n < p and p > 2, then this extension of the MSO 
is called the counting monadic second-order logic. So essentially CMSO is MSO with following 
atomic formula: If U denotes a set X, then card„ jP (C7) = true if and only if \X\ is n mod p. 
We refer to [H [191 EI] for a detailed introduction on CMSO. 

2.5 i-Boundaried Graphs 

Here we define the notion of t-boundaried graphs and various operations on them. 

Definition 2. [i-Boundaried Graphs] A t-boundaried graph is a graph G = (V,E) with t 
distinguished vertices, uniquely labeled from 1 to t. The set d{G) of labeled vertices is called the 
boundary of G. The vertices in d{G) are referred to as boundary vertices or terminals. 

For a graph G = (V, E) and a vertex set S C V, we will sometimes consider the graph G[S] 
as the |3(5)|-boundaried graph with d(S) being the boundary. 

Definition 3. [Gluing by ©] Let G\ and G2 be two t-boundaried graphs. We denote by Gi@G% 
the t-boundaried graph obtained by taking the disjoint union of G\ and G2 and identifying each 
vertex of d{G\) with the vertex o/c^G^) with the same label; that is, we glue them together on 
the boundaries. In G\ © G2 there is an edge between two labeled vertices if there is an edge 
between them in G\ or in G2. 

Definition 4. [Legality] Let Q be a graph class, G\ and G% be two t-boundaried graphs, and 
Gx, G% G Q. We say that G\ ® G2 is legal with respect to Q if the unified graph G\ © G2 £ Q ■ If 
the class Q is clear from the context we do not say with respect to which graph class the operation 
is legal. 

Definition 5. [Replacement] Let G = (V, E) be a graph containing an extended r-protrusion 
X. Let X' be the restricted protrusion of X and let G\ be an r-boundaried graph. The act of 
replacing X' with G\ corresponds to changing G into G\V \ X'] © G±. Replacing G[X] with G\ 
corresponds to replacing X' with G\ . 

2.6 Finite Integer Index 

Definition 6. For a parameterized problem, IT on a graph class Q and two t-boundaried graphs 
G\ and G2, we say that G\ =n G2 if there exists a constant c such that for all t-boundaried 
graphs G3 and for all k, 

• G\ © G3 is legal if and only if G2 © G3 is legal. 
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• (Gi G 3 , k) ell if and only if (G 2 G 3 , k + c) G II. 

Definition 7. [Finite Integer Index] II has finite integer index in Q if for every t there exists 
a finite set S of t-boundaried graphs such that S C Q and for any t-boundaried graph G\ there 
exists a G2 G S such that G2 =n G\ . Such a set S is called a set of representatives for (II, t) . 

Note that for every t, the relation =n on t-boundaried graphs is an equivalence relation. A 
problem II is finite integer index, if and only if for every t, =n is of finite index, that is, has a 
finite number of equivalence classes. 

3 Reduction Rules 

In this section we give reduction rules for compact annotated p-min/eq/max-CMSO graph 
problems and quasi-compact parameterized problems having finite integer index. Our reduction 
rules have the following form: 

If there is a constant size separator such that after its removal we obtain a con- 
nected component of unbounded size and of constant treewidth, then we replace 
this component with something of constant size. 

The implementation of this rule depends on whether we are dealing with an annotated p-MiN- 
CMSO, p-EQ-CMSO or p-MAX-CMSO problems, or whether the problem in question has finite 
integer index. Our reduction rules for annotated p-min/eq/max-CMSO problems have three 
parts. In the first two parts we zero in on an area to reduce, in the last part we perform the 
reduction. In all cases, we assume that we are given a sufficiently large r-protrusion. In the 
following discussion we only treat annotated j>-min/eq/max-CMSO problems where the set S 
being searched for is a set of vertices. The case where S is a set of edges can be dealt in an 
identical manner. 

3.1 Reduction for Annotated p-MlN-CMSO Problems 

We now describe the reduction rule that we apply to annotated p-MiN-CMSO problems. The 
technique employed in this section will act as a template for how we handle the annotated p- 
eq/max-CMSO problems. Recall that in an annotated p-MlN-CMSO problem IP we are given 
a graph G = (V, E) where a subset Y of the vertices of G is colored black and an integer k. The 
objective is to find a set S C Y of size at most k such that a fixed CMSO-definable property 
Pji(G,S) holds. For our reduction rule, we are also given a sufficiently large r-protrusion X' . 
In the first step of the reduction, we show that the set Y n X' can be reduced to 0(k) vertices 
without changing whether (G, k) is a yes instance to IP or not. In the second step we show that 
the r-protrusion X' can be covered by O(k) r'-protrusions such that each r'-protrusion contains 
at most a constant number of vertices from Y. In the third and final step of the reduction rule, 
we replace the largest r'-protrusion with an equivalent, but smaller r'-boundaried graph. We 
now provide the reduction rule for annotated p-MiN-CMSO problems. 

Lemma 1. Let U a be an annotated p-MiN-CMSO problem. Let G = (V,E) be a graph, Y QV 
be the set of black vertices and k be an integer. Let X be an extended r-protrusion of G. Then 
there is an integer c, and an 0(\X\) time algorithm, that computes a set of vertices Z C X flF 
with \Z\ < ck, such that if there exists aW C Y with Pn(G, W) and \ W\ < k, then there exists 
a W C Y with P U (G, W), \W'\ < k' , and W n X C Z. 
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Proof. The algorithm starts by making a tree decomposition of G[X] of width at most r, using 
the linear time algorithm to compute treewidth by Bodlaender [SJ. Now we add d{X) to each 
bag, and add one bag containing only the vertices in d{X). The tree decomposition has width 
at most 2r as the bag size is at most r + 1 + < 2r + 1. 

Consider the following equivalence relation on subsets Q C X n Y. We say that Q ~ Q', if 
and only if for all R C V — X: 

Pn(G,QuR)^Pn(G,Q'uR). (1) 

The number of equivalence classes is bounded by a function of the treewidth |14| |2"2] of 
and thus for fixed r, can be assumed to be bounded by a constant, say c. We would like to find 
a minimum sized representative of each of the equivalence classes. We describe an algorithm 
running in time 0(|X|) to find the desired set in each equivalence class. Let us consider an 
algorithm that solves an optimization version of the problem 

min{|W| WCYA P n {G, W)} 

on graphs of bounded treewidth, using a dynamic programming approach. For an example, we 
can use the algorithm described by Borie et al. |14j . The algorithm of Borie et al. p3] computes 
for each equivalence class in Relation[T](or, possibly, a refinement of the Relation[T]) the minimum 
size of a set in the class. This is done in a dynamic programming fashion, computing each value 
given a table of these values for the children of the bag in the tree decomposition. The running 
time is linear for fixed c. It is not hard to observe that we can also compute for each equivalence 
class a minimum size set Q Q X f]Y that belongs to the class. This can be done in linear time. 
If there are more than one minimum size sets in a class, then the algorithm just picks one. 

Let Q denote that set of equivalence classes of Relation [1] and suppose that for each class 
q € Q that is non-empty, we have a minimum size representative Q q . By the above argument 
we can find a minimum size representative Q q for each class in 0(|X|) time. Now, set 

Z= |J Qg. 

q£Q,\Q g \<k 

Suppose now that there exists W CF with Pn(G, W) and \ W\ < k. Consider the equivalence 
class q that contains X(~)W. Let Q q be the selected minimum size representative of q. Consider 
the set W = (W \ X) U Q q . As Pu (G, {W \ X) U {X n W)), we have that P u {G,(W\X)uQ c ) = 
Psj(G, W). Since, Q q is a minimum size representative from q, we have that \Q q \ < \W \ X\, 
and that \ W'\ < \W\. Finally, since the number of equivalence classes in Q is a function of r and 
each representative Q q has size at most k, we have that \Z\ = 0{k). This proves the lemma. □ 

Using Lemma [H we change the set Y to (Y\ X) U Z. We now show how to exploit the fact 
that Z contains 0(k) vertices. 

Partitioning Protrusions: In the second step of the reduction rule we partition the extended 
r-protrusion X into smaller r'-protrusions. 

Lemma 2. Let G = (V, E) be a graph, Y C V be the set of black vertices of G and k be 
an integer. Furthermore, let X be an extended r-protrusion and Z = X f]Y . There is a 
0(\X\) time algorithm that finds 0(\Z\) extended r'-protrusions X\,X2, . .. ,X# such that X = 
IiUl 2 U...Ul f and for every i < I, Z n X { C d(Xi). 
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Proof. We start by making a nice tree decomposition of G[-X], and adding d(X) to all bags. 
Now, we mark a number of bags. First, we mark the root bag and each forget node where a 
vertex in Z is forgotten. As each vertex is forgotten at most once in a nice tree decomposition, 
so far we have 0(|Z|) marked bags. Now, mark each bag that is the lowest common ancestor 
of two marked bags, until we cannot marks in this fashion. Standard counting arguments for 
trees show that this operation at most doubles the number of marks. Hence, there are at most 
0(|Z|) marked bags. 

We now split X into X x , X 2 , ■ ■ ■ , Xg as follows: we take parts of the tree decomposition, with 
internally no marked bags, plus the marked bags at the border. Note that each such part has 
at most two marked bags, each of size at most 2r and thus the size of d{Xi) is at most 4r for 
every i < i. Note that by the way we constructed the sets X{, Z n Xi C d(X{) for every i. □ 

Reducing Protrusions: In the third phase of our reduction rule, we find a protrusion to 
replace, and perform the replacement. Notice that this part of the reduction works both for 
annotated p-MlN-CMSO and for annotated p-EQ-CMSO problems. 

Lemma 3. Let IP be an annotated p-MiN-CMSO or p-EQ-CMSO problem. There is a fixed 
constant c depending only on LP such that there is an algorithm that given a graph G = (V, E) G 
Q g , a set Y C V of black vertices, an integer k and an extended r-protrusion X with \X\ > c 
such that Y D X CI d(X), runs in time 0(\X\), and produces a graph G* = (V*,E*) G Q g such 
that \V*\ < \V\ and (G*,k) G IP if and only if(G,k) G IP. 

Proof. For two i-boundaried graphs G\ and G2, we say that they are equivalent with respect 
to a subset S of d{G\) = d(G 2 ) if for every G3 = (Vs,E 3 ) and set S' C V3 we have that 
Pu(Gi ®G 3 ,SUS') if and only if Pu(G 2 ®G 3 ,SUS'). If G x and G 2 are equivalent with respect 
to S we say that G\ ~s Gi- The canonical equivalence relation for CMSO properties with free 
set variables has finite index [14|. 122] and hence the number of equivalence classes of ~s depends 
only on t for every fixed S C d{G\). We make a new equivalence relation defined on the set of 
t-boundaried graphs belonging to a graph class Q. Two £-boundaried graphs Gi,G 2 G G g are 
equivalent if 

• for every t-boundaried graph G3, G\® G3 is legal if and only if G 2 © G3 is legal; and 

• for every S C d(G x ) = d(G 2 ), d ~ s G 2 . 

Now, ~s has a finite number of equivalence classes for every S C d(G\) and the class Q g is 
characterized by a finite set of forbidden minors. Hence the number of equivalence classes in 
the equivalence relation defined above is a function of t. Let S be a set of r-boundaried graphs 
containing one smallest representative for each equivalence class of the relation above. Let c the 
size of the largest graph in S. We have that X is a r-boundaried graph with boundary d(X). 
Let Gi be a graph in S such that G\ and G[X] are equivalent. 

Now we replace G[X] with G% = (V%,Ei) in the graph G, and let the resulting graph be 
G* = (V*,E*). Let Yi be the set of black vertices in d(G\). We let Y* = Y \ X U Y x be the set 
of black vertices in G* . Since \X\ > c, we have that |Vi| < |X| and hence \ V*\ < \ V\. It remains 
to prove that (G, k) G Tl a if and only if (G* , k) G n Q . In one direction, suppose there is a set 
SCY such that P n {G, S) holds. Then SfllC d(G[X}) and since G[X] and G x are equivalent 
with respect to the relation above we have that P\j(G* , S) holds. In the other direction, suppose 
Pu{G*,S) holds. Since Y* D V\ C d{G x ) and G[X] and G% are equivalent with respect to the 
relation above, we have that Pu(G,S) holds. This concludes the proof. □ 
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Lemmata[TJ[2]and[3]together yield a reduction rule for all annotated p-MiN-CMSO problems. 

Lemma 4. Let IF be an annotated p-MiN-CMSO problem. There is a fixed constant c de- 
pending only on IF such that there is an algorithm that given a graph G = (V,E) G Q g , a set 
Y C V of black vertices, an integer k and an extended r-protrusion X with \X\ > ck, runs in 
time 0{\X\), and produces a graph G* = (V*,E*) G G g such that \V*\ < \ V\ and (G*,k) G IF 
if and only if (G, k) G IF . 

Proof. The algorithm starts by applying Lemma [T] to X, thus making all but at most ak black 
vertices uncolored for some fixed constant a. By Lemma El X = X\ U X2 . . . U X^k for some 
fixed constant b, where for every i, X{ is an extended 4r-protrusion such that Y D Xi C d(Xi). 
By the pigeon-hole principle some Xi has size at least \X\jbk > c/b. Choose c such that c/b is 
sufficiently large to apply the algorithm in Lemma [3l and then apply Lemma [3] on the extended 
protrusion Xi. This concludes the proof. □ 

3.2 Reduction for Annotated p-EQ-CMSO Problems 

In this section we give a reduction rule for annotated p-EQ-CMSO problems. The rule is very 
similar to the one for the p-MiN-CMSO problems described in the previous section. Therefore 
we only highlight the differences between the two rules in our arguments. The main difference 
between the two problem variants is that we need to keep track of solutions of every fixed size 
between and k, instead of just the smallest one in each class. Because of this we require the 
protrusion to contain at least ck 2 vertices instead of ck vertices, in order to be able to reduce 
it. 

Lemma 5. Let IF be an annotated p-EQ-CMSO problem. There is a fixed constant c depending 
only on IF such that there is an algorithm that given a graph G = (V,E) G Q g , a set Y C V 
of black vertices, an integer k and an extended r-protrusion X with \X\ > ck 2 , runs in time 
0(k\X\), and produces a graph G* = (V*,E*) G Q g such that \V*\ < \V\ and (G*,k) G IF if 
and only if(G,k) G IF . 

Proof. We show that if YC\X > ak 2 for some fixed constant k, then we can remove some vertices 
from Y preserving the answer. The proof proceeds almost as the proof of Lemma [TJ The main 
difference is that now, instead of taking a minimum size representative from each equivalence 
class we consider all possible sizes for S between and k, for each equivalence class. That is, 
for each £, < t < k, and each equivalence class q, we make a set Q q / C X from the class with 
\Qq,t\ = Now, set 

Z= |J Q Q/ . 

qeQ,o<£<k,\Q qii \=£ 

In the dynamic programming algorithm, we must also have one table entry for each class and 
each size. This gives a running time of 0(A;|X|) for the first part of the reduction rule. 

Next, we remove all vertices in X \ Z from Y and apply Lemma [2j This gives us X = 
X\ U X2 ■ ■ ■ U X bk 2 for some constant 6, where for every i, Xi is an extended 4r- protrusion with 

znXi<z d(Xi). 

By the pigeon-hole principle some Xi has size at least \X\jbk 2 > c/b. Choose c such that 
c/b is sufficiently large to apply the algorithm in Lemma [3l and then we apply Lemma [3] on the 
extended protrusion Xi. This concludes the proof. □ 



10 



3.3 Reduction for Annotated p-MAX-CMSO Problems 



We now give a reduction rule for annotated p- MAX- CM SO problems. The rule is still similar 
to the ones described in the two previous sections, but differs more from the p-MiN-CMSO 
problems than p-EQ-CMSO did. 

Lemma 6. Let IP be an annotated p-MAX-CMSO problem. There is a fixed constant c de- 
pending only on U a such that there is an algorithm that given a graph G = (V,E) G Q g , a set 

Y C V of black vertices, an integer k and an extended r -protrusion X with \X\ > ck, runs in 
time 0(\X\), and produces a graph G* = (V*,E*) G G g such that \V*\ < \ V\ and (G*,k) G IF 
if and only if (G, k) G IP. 

Proof. We again begin in a manner similar to the proof of Lemma [TJ The main ingredient in 
the proof of Lemma Q] is that for a given extended r-protrusion X, we consider the equivalence 
relation ~ on subsets Q C X, where we demand that Q ~ Q' if and only if for all R C 

V — X: Pji(Q U R) 44> Pn(Q' U R)- The number of equivalence classes of ~ is bounded, but for 
maximization problems we need to keep the largest representative of each class. Hence we can 
not guarantee that the union of all the representatives we store is bounded by a function of k. 
To overcome this difficulty we use the expressive power provided by annotation. We compute 
a largest representative Q q C X for each equivalence class q of ~. Then, we build a vertex set 
Z C X n Y as follows. For each non-empty equivalence class q, if \Q q (~}Y\ < k, then we add 
Q q n Y to Z. If \Q q Pi Y\ > k, then we select arbitrarily a /c-sized subset of Q q n Y and add it 
to Z. Thus \Z\ < ak for some constant a. We remove all vertices in X \ Z from Y, without 
changing the membership of (G, k) in IP. 

Next we apply Lemma [2] on X. This gives us X = X% U X<i . . . U X^ for some constant b, 
where for every i, Xj is an extended 4r-protrusion with Z n Xi C d(Xj). 

We now describe how to modify Lemma [3] so that it can also be applied to annotated p- 
MAX-CMSO problems. For a set S C Y we define P(i(G, S) as there exists a set S' containing S 
such that Pn(G, S') holds. Clearly (G, k) £ IF if and only if there is a set 5CFof size k such 
that Pyi(G,S). If the relation ~^ in Lemma [3] is defined using P^(G,S) instead of Pn(G, S), 
the proof goes through also for annotated p-MAX-CMSO problems. 

By the pigeon-hole principle some Xi has size at least \X\/bk > c/b. Choose c such that c/b 
is sufficiently large to apply the algorithm in the modified version of Lemma O and then apply 
the modified version of Lemma [3] to p-MAX-CMSO problems on the extended protrusion Xi. 
This concludes the proof. □ 

3.4 Reductions Based on Finite Integer Index 

In the previous sections we gave reduction rules for annotated p-min/eq/max-CMSO problems. 
These reduction rules, together with the results proved later in this article will give quadratic 
or cubic kernels for the problems in question. However, for many problems we can in fact show 
that they admit a linear kernel. In this section we provide reduction rules for graph problems 
that have finite integer index. These reduction rules will yield linear kernels for the problems 
they apply to. We are now ready to prove the reduction lemma for problems that have finite 
integer index. 

Lemma 7. Let n C Q g x N has finite integer index in Q g and either n or li is quasi- compact. 
There exists a constant c and an algorithm that given a graph G = (V, E) G Q, an integer k 
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and an extended r -protrusion X in G with \X\ > c, runs in time 0{\X\) and returns a graph 
G* = (V*,E*) G Q g and an integer k* such that \V*\ < \V\, k* < k and (G*,k*) £ U if and 
only if (G, k) € II. 

Proof. Let S be a set of representatives for (II, r) and let c = maxygs \Y\. Similarly let S' be 
a set of representatives for (II, 2r) and let d = maxyes' \ Y\. If \X\ > 3c' we find an extended 
2r-protrusion X' C X such that c' < \X'\ < 3c' and work on X' instead of X. This can 
be done in time 0(|X|) since G[X] has treewidth at most r. From now on, we assume that 
|X| < 3c'. This initial step is the only step of the algorithm that does not work with constant 
size structures, and hence the running time of the algorithm is upper bounded by 0(|X|). The 
algorithm proceeds as follows. 

Because II has finite integer index there is a graph H = (Vh, Eh) G <S such that H =n G[X]. 
We show how to compute H from X. Since either II or LT is quasi-compact there exists an integer 
p such that k < \ V\ P . Let k max = (6c') p . For every G 1 = G S, G 2 = (V 2 ,E 2 ) G S and 

k' < k max we compute whether (Gj © G 2 , k 1 ) G II. For each such triple the computation can be 
done in time 0((|Vi| + |V2|) P ) since LT has finite integer index \13\ 124] . Now, for every G\ G S 
and k' < (\X\ + \ Vi\) p we compute whether (G[A] @ G\,k') G II. When all these computations 
are done, the results are stored in a table. 

It is not hard to see that H =n G[X] if and only if there exists a constant c such that 
for all G 2 G S and k! < k max , (H © G 2 ,k') G n O (G[X\ ®G 2 ,k' + c) G n. Also, c is the 
constant such that for all r-boundaried graphs G 2 and and integers k' , (H © G 2 ,k') G II <£4> 
(G[X] © G 2 , k' + c) G II. For each H G C we can check whether H =n G[X] using this condition 
and the pre-computed table, and if H =n GLY], find the constant c. 

After we have found a H G S and the corresponding constant c, such that H =n G[A], we 
make G* from G by replacing the extended r-protrusion X with H. Also, we set k* = k — c. 
Since |A| > c and H has at most c vertices, \V*\ < \V\. By the choice of H and c, (G*, k*) Eli 
if and only if (G, k) G II. This concludes the proof. □ 



4 Decomposition Theorems 

Definition 8. We say that a graph G = (V, E) is (a, (3, 7)-structured around S if\S\ < a and V 
can be partitioned into S, C\, C 2 , . . . , C 7 such that No(Ci) C S, and max{|A r c(Ci)|,tw(G[Ci U 
N G(Ci)})} < (3, for every i = 1, . . . , 7. 

Let G = (V,E) be a graph embedded in some surface S. A noose in G is a closed curve N 
of S meeting only the vertices of G, we denote these vertices by Vjy = V D 2V. We also define 
the length of 2V as the number of vertices it meets and we denote it by |2V|, that is, |2V| = \ Vn\- 
The face-width of G is the minimum length of a non-contractible noose in G. We denote the 
Euler genus of a surface E by eg(S). 

Lemma 8. Xe£ G = (V, E) be a graph in Q g and let S C V smc/i i/iaf B^(5) < Then, 
tw(G) < 4(2r + l)Vg + 2c/ + 8r + 12 5 . 

Proof. The result follows closely the argument of the proof of Theorem 3.2] that, in turn, 
is based on [251 Lemma 3.1] about the distribution of every such set S in the interior of a 
(p x p)-grid. As now we have graphs of higher genus we have to apply [27\ theorem 4.7] and 
find a lower bound on the size of S in the graph obtained by a (p x p)-grid after adding O(g) 
edges. □ 
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From now on, we set f(r, g) = 4(2r + 1)^/2 + 2g + 8r + 12g. 



Lemma 9. Let G = (V 7 , E) be a graph, embedded in a surface £ of Euler genus g such that 
either g = or the face-width of G is strictly greater than 4r + 2. Let S C.V be a set containing 
at least 3 vertices where B£,(S) = V. Then there exists a set S'cy such that S C S' and G is 
(r • (4r + 2) • (3\S\ — 6 + 6g), f(r, g),r ■ (3\S\ — 6 + 6g)) -structured around S' . 

Proof. We first need the following claim. 

Claim: Let G = (V, E) be a graph, embedded in a surface E of Euler genus g such that either 
g = or the face-width of G is more than 4r + 2. Let S be a set of at least 3 vertices such that 
Wq(S) = V. Then there is a collection TZ of closed subsets of £, also known as regions, such 
that 

• If two sets in TZ have common points, then these points lay on their boundaries. 

• UjMi*nv = v. 

• The boundary of each R G TZ is the union of two paths of length < 2r + 1 between two 
vertices of S called the anchors of R. We denote the set of vertices on the boundary of R 
by bor( J R). 

• For each R G TZ with u and v as anchors it holds that R n V C B^.({u, t> }). 

• For each R &TZ with u and u as anchors it holds that S D R = {u, v}. 

• \1Z\ < r ■ (3\S\ +6g- 6). 

The above claim follows from [38\ Lemma 1] for the case where g = 0. For the sake of complete- 
ness, we briefly present this proof together with its natural extension for embeddings of higher 
genus provided that the face-width of G is > 4r + 2. A collection 1Z of closed subsets of S is 
constructed by a greedy algorithm as follows: 

Start with an empty 1Z and, as long as there are vertices not contained in some 
region R in 1Z, hnd a area-maximal region R dehned by two paths of length < 2r + 1 
between two vertices in S (its anchors) that do not contain any vertex in S and add 
it to TZ. 

Notice that such a region R always exists even for non-planar embeddings, provided that the 
face-width of G is bigger than 4r + 2, as this region is always inside a big enough disk of the 
surface. To prove the claimed upper bound on the size of TZ, consider a multigraph G-ji (again 
embedded in E) with vertex set S and there is an edge between u,v G S whenever u and v are 
the anchors of some region R G TZ. Notice that G-ji is also embedded in E and has face-width 
bigger than 1. This in turn implies that if two vertices u,v are joined by many copies of the 
same edge in Gfi, then a pair of these edges will define a disk A in E containing in its interior all 
other copies. Moreover, we may assume that E\ A contains a vertex w of S (this is necessary in 
case G has no planar embedding). We claim that the following thinness property holds: if there 
are x > 2r copies of the edge {u,v}, then there exists a vertex w' G S laying in the interior of 
one of the x — 1 > 2r — 1 area minimal disks Aj, . . . , A x defined by the copies of {u, v} inside 
A (the order is chosen such that consecutive disks have common edges). We prove this using 
the argument of the proof of [38, Lemma 1]. Assume to the contrary and consider the disk A r . 
Indeed, by area-maximally of the choice of the regions in the above greedy procedure, it follows 
that A r contains a vertex z G V whose distance in G from u and v and every other vertex in 
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S is bigger than r, a contradiction. This implies that there exists a vertex w'gS laying in the 
interior of one of the the x — 1 > 2r — 1 area minimal disks Ax, • • • , Aj. defined by the copies 
of {u,v} inside A. Using the thinness property of G-ji (see also [U Lemma 5]) along with the 
Euler formula for graphs embedded in higher genus surfaces, we derive the claimed bound for 
1 72, | and the claim follows. 

To complete the proof of the lemma, we define S' = U_Re^ bor(-R) H V, that is, S' contains 
all the vertices belonging to the boundary of the sets in 1Z. As each such boundary is the union 
of two (u, u)-paths of length < 2r + 1 where u,v 6 S, we have that such a boundary can have at 
most 4r + 2 vertices. As \K\ < r ■ (3\S\ +65-6), we obtain that \S'\ < r ■ (4r + 2) • (3|5| - 6 + 65). 
For each Ri G 1Z assign a set Cj containing each connected component C = (Vc,Ec) of G\S' 
for which -ZVg^T'c') Q hor(R). If a connected component of G \ S' can be assigned to more 
than one i^, break ties arbitrarily so that {Ci,i = 1, . . . , \TZ\} forms a partition of the set of 
the connected components of G \ S'. Notice that for each C £ Ci, Ng{Vc) Q Ri Q S' and if 
we set Q = Uce&Vc we have that N G {d) C Ri C S' and thus \N G (d)\ < 4r + 2 < f(r,g) 
(for z = 1, . . . , \1Z\). It remains to prove that if J{ = G[C{ U Na{Ci)] then tw(Jj) < f(r, g). For 
this, recall that iJflFC B£,({u, u}) and from LemmaEl we obtain that tw(Jj) < f(r,g), (for 
j = l,...,|^|). □ 

We are now in position to prove the following. 

Lemma 10. Let G = (V, E) be a graph, embedded in a surface S of Euler genus g and let 
S Q V , \S\ > 3 such that YV G {S) = V for some r > 0. Then there exists a set S' such that 
SCS' and G is (h(r,g) ■ \S\,h(r,g),h(r,g) • \S\) -structured around S' , where h(r,g) = 0(rg). 

Proof. We use induction on g. In particular, we prove that there exists a set S' C V such that 
G is (a g 151, P g , 7 9) |s|)-structured around S' where, for g = 0, we set ao tX = r ■ (4r + 2) • (3x — 6), 
@o = /(r, 0) and 7o jX = r • (3x — 6) and for g > 1 we have a 5)X = (2p — 1) • r • 6(4r + 2) 2 + r • 
(4r + 2) • (3s + 6g - 6), p g = f(r,g) and 7 9iX . = (2g - 1) • r ■ 6(4r + 2) + r • (3x + 6g - 6). Once 
we have proved this, the lemma follows by suitably choosing the function h. For our proof we 
distinguish two cases: 

Case 1. g = or the embedding of G has face-width > 4r + 2. Then we draw G together with 
its radial graph R G = (Vr,Er) and denote it by G = G U R G = (V,E), the superposition of 
the two drawings. By the definition of the radial graph, it follows that B^ T (S I ) = V. Clearly, 
as G is a subgraph of G, the result will follow if we prove that G is (a g 151, /3 g , 7 fl i£<|)-structured 
around some set S' QVr. If G is embeddable in the sphere, then G is also embeddable in the 
sphere and if G is embeddable in a surface E of Euler genus g > Ar + 2 then the face-width of 
G is also greater than 4r + 2. Therefore, in either case we have all the conditions required to 
apply Lemma [9] and hence the claim follows by applying Lemma [9J 

Case 2. There is a non-contractible noose iV in E of length at most 4r + 2. Then we split the 
graph along the vertices Vm of the noose and we distinguish two subcases depending if iV is a 
surface separating or not. 

Subcase 2.1. If iV is a surface separating noose, then the splitting of the vertices of iV creates 
two graphs G\ = (Vi,E\) and G2 = (^2,-^2) embedded in surfaces Ei and E2 respectively 
such that if eg(Ej) = gi,i = 1,2 then g\ + g<i < g and g\ ■ g<i > 0. Let Si consists of the 
vertices in S D V% and all the (duplicated) vertices met by N. Notice that ~R r G .(Si) = V{ and 
that \Si\ + IS2I < 2|JV| + \S\ < 2(4r + 2) + \S\. 
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By the induction hypothesis, Gi is (a g . 15.1, f3 gi , 7 s .|s.i)-structured around some set S[ where 
Si Q S[ and gi > 1, i = 1, 2. Let G + be the disjoint union of G\ and G2 and observe that G + is 
(a git \ Sl \ + a 92> |5 2 |,max{/3 31 ,/3 S2 },7 9l | 5l | + 7 92i | ^-structured around S[ U S^- Notice that 



a gi,\Si\ + <Xg2,\S 2 \ < (2 5 i-l + 2 52 -l)-r-6(4r + 2) 2 + 

r • (4r + 2)(3|5i| + 3|5 2 | + 651 + 6g 2 - 12) 

< (2pi + 2g 2 - 2) • r ■ 6(4r + 2) 2 + 

r ■ (4r + 2)(3(2(4r + 2) + +65-6) 

< {2gi + 2^2 - 1) • r • 6(4r + 2) 2 - r • 6(4r + 2) 2 + 
r • 6(4r + 2) 2 + r ■ (4r + 2) • (3\S\ +65-6) 

< (25 - 1) • r • 6(4r + 2) 2 + r • (4r + 2) • (3|5| +65-6) 
= a g ,\s\- 

Similarly, we can show that 'YgiJS'il + T^,!^ — Tg,|S| an d, as max{/3 9l , /3 S2 } < /9 g , we conclude 
that G + is (a Si |5|,/3 g ,7 g i5|)-structured around S[ U 5 2 . As all the duplicated vertices of Vn are 
in Si U 5*2, we can identify back these duplicated vertices occurring in Si and S 2 and obtain 
that G is (a g igi, /3 S , 7 p J5|)-stractured around some set 5'. 

Subcase 2.2. If N is not a surface separating noose, then the splitting of the vertices of N 
creates a new graph Go = (Vo,Eq) embedded in a surface So of Euler genus go where go < g. 
Notice that if So is the set of vertices in Go consisting of the non-duplicated vertices of S and 
all the duplicated vertices, then R r Go (S ) = V and \S Q \ < 2\N\ + \S\ < 2(4r + 2) + \S\. From the 
induction hypothesis Go is (a go ,|s |,/3g >'Ygo ,|s |)-structured around some set S' where So C S' Q . 
Notice that 

a go ,\s \ < (2<7o - 1) • r • 6(4r + 2) 2 + r • (4r + 2) ■ (3\S \ + 6g -6) 

< (2(g - 1) - 1) • r ■ 6(4r + 2) 2 + r ■ (4r + 2) • (3(2(4*- + 2) + \S\) + 6g - 6) 

< (2g - 1) • r • 6(4r + 2) 2 - 2 • r • 6(4r + 2) 2 + 
r ■ 6(4r + 2) 2 + r • (4r + 2) • (3|5| +65-6) 

(25 - 1) • r ■ 6(4r + 2) 2 + r ■ (4r + 2) • (3\S\ +65-6) 

= a 9,\S\- 

and similarly 7 So is | — 7s,|S|- Ago — @gi we conclude that Go is (a g 151, /3 g , 7 a |5|)-structured 
around 5 . As all the duplicated vertices of Vjv are in So, we can identify them back and deduce 
that G is (q s |5|, 7 5j |5|)-structured around some set S'. This concludes the proof. □ 



5 Kernels 

In this section we prove Theorems [1] and [2j We say that an instance (G', k') of a parameterized 
problem II is reduced with respect to a set Q of reduction rules if none of the reduction rules in 
Q can be applied to (G',k'). 



5.1 Proof of Theorem Q] 

Proof. We first give a proof for the case when II is compact and IP is an annotated p-MIN- 
CMSO problem. A proof for the case when II is compact and II a is an annotated p-eq/max- 
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CMSO problem is identical. 

We know that II is compact and hence there exists an integer r such that for all (G = 
(V, E),k) G IP, there is an embedding of G into a surface of genus at most g, and a set S C V 
such that Hq(S) = V and \S\ < r ■ k. We show that for all (G,k) G H a , the equivalent 
instance (G 1 = (V' ,E'),k), reduced with respect to the reduction rule given by Lemma HI has 
\V'\ = 0(k 2 ). Since (G' = (V',E'),k) G Tl a and II is compact, there exists an embedding of 
G' into a surface of genus at most g and a set S C V' such that Hq, (S) = V. Hence by 
applying Lemma [10] we obtain a set S' such that G' is (a, (3, 7)-structured around S', where 
a, 7 = 0(rg\S\) and /3 = 0{rg). This implies that V' can be partitioned into S', C%, C2, ■ ■ ■ , C 7 
such that 7V G ,(Ci) c S', \N G '(Ci)\ < (3 and tw(G'[Ci U iV G (Ci)]) < for every i < 7. Observe 
that each Cj is a /3-protrusion in G' and |Cj U iVis(Ci)| < cfc, where c is a constant of Lemma [H 
otherwise we could have applied Lemma [H This implies that 



for some fixed g and r. Here constants hidden in big-Oh depend only on r and g. 

So given an input (G, k), if the size of the reduced graph is more than c*k 2 for some constant 
c* then we return NO else we have G' as the desired annotated kernel for G. 

Now we give a proof for the case when n is compact and n a is an annotated p-MAX- 
CMSO problem. A proof for the case when n is compact and n a is an annotated p-min/eq- 
CMSO problem is similar. Towards this end, we claim that for all (G,k) G n°, the equivalent 
instance (G' = (V' , E'), k), reduced with respect to the reduction rule given by Lemma El has 
\V'\ = 0(k 2 ). The proof for the claim is identical to the one we gave above to bound all the 
YES instance for an annotated compact p-MiN-CMSO problem. So given an input (G,k), if 
the equivalent instance (C = (V',E'),k), reduced with respect to the reduction rule given by 
Lemma El is more than c*k 2 for some constant c* then we return YES else we have G' as the 
desired annotated kernel for G. The reason we return YES is that if (G, k) would have a NO 
instance then the size of \V'\ < c*k 2 as (G, k) G ff*. □ 

5.2 Proof of Corollary CD 

Proof. We know that n is NP-complete and the annotated version Tl a is in NP. So given an 
instance (G = (V,E),k), we apply Theorem [1] on the annotated instance (G = (V,E),V,k), 
that is we take V as Y, the set of black vertices. If we get YES or NO as an answer then we 
return the same. Else for {G' = (V',E'),k) of size polynomial in k, we apply polynomial time 
many to one reduction from n a to n on G' and obtain a graph G" = (V", E") G Q g and an 



integer k' such that \V"\,k' < A;° (1) and (G',k) G U a if and only if (G", k') G n. This implies 



5.3 Proof of Theorem [2] 

In this section we prove Theorem [2J The idea of the proof is that if a problem has finite 
integer index then an instance reduced with respect to the reduction rule given by Lemma [7] 
has bounded radial distance. We first prove a lemma which will assist us in proving Theorem [2j 




that in this case we have polynomial kernel for n. 



□ 
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Lemma 11. Let G = (V,E) be a graph embedded into a surface of genus at most g and 5C1/ 
such that tw(G \ Rq(S)) < r and all r -protrusions of G have size at most m. Then there exists 
aS' QV such that \S'\ < \S\ + g ■ (m + 2r + 1) and V = R^ +3r+1 (S") . 

Proof. We prove the lemma using induction on Euler genus g of the graph G \ H, G (S). We 
distinguish two cases: 

Case 1. g = or the embedding of G \ YV G {S) has face-width > m + 2r + 1. We claim that V = 
R£f +3r+1 (5), that is, S' = S. For this assume, towards a contradiction, that x £ V\R G +3r+1 (S) 
and consider the subgraph J of G induced by the set ~R G +2r+1 (x), that is, the vertices of G 
that are within radial distance at most m + 2r + 1 from x. Notice that J is a subgraph of 
G\Hq(S), therefore tw(J) < r. As either g = or because the face-width of G is > m+2r + 1, 
all the vertices and edges of J are embedded inside a closed disk in E. Moreover, there exist 
m + 2r + 1 nested disjoint cycles Ci, . . . , C m +2r+i with vertex sets Vi, . . . , V m +2r+i respectively 
in Rq such that, if A, is the closed disk with C, as its border and contains x then i < j implies 
that Aj C Aj. For i = 1, . . . ,m + 2r + 1, Vi contains vertices and faces whose radial distance 
from x, in G, is exactly i. We need the following claim. 

Claim. Each cycle of the radial graph Rq that is entirely in (A m+r+ i \ A m ) and separates S 
and x, has length > 2r. 

Proof. Indeed, if this is not the case for some cycle C with vertex set Vc, then L = V n Vc is 
a separator of G where \L\ < r and such that the connected component, say F, of G \ L that 
contains x is a subgraph of J. Then tw(F) < tw(J) < r and i 7 is an r-protrusion of G. As 
F contains x and has more than m vertices, it is a contradiction to the assumption that all 
r-protrusions of G have size at most m. □ 
Applying the above claim to the cycles C m +i, . . . , C m +2r+i we obtain that they all have 
length > 2r. We now construct an auxiliary graph R* by taking Rq Pi (A m+r+ i \ A m ), adding 
a vertex s adjacent to all vertices of C m +i and adding a vertex t adjacent to all vertices in 
C m+ 2r+i- We claim that there is no (s, i)-separator in R* of size < 2r. Indeed, such a separator 
would imply the existence of a cycle C in Rq of size < 2r, a contradiction to the above claim. 
By Menger's theorem, it follows that there are > 2r disjoint (s, t)-paths in R* that correspond 
to > 2r disjoint paths from the vertices of C m+ \ to the vertices of C m+ 2 r +i- The intersection 
of these paths with cycles C m+ i, . . . , C m+ 2 r +i, imply the existence of a (2r + 1) x (2r + l)-grid 
as a minor of Rj, where Rj is the radial graph corresponding to J. Therefore, tw(i?j) > 2r 
(from [9, Lemma 88]), which, using \43\ Lemma 3], implies that tw(J) > r, a contradiction. 

Case 2. There is a non-contractible noose N, meeting the vertices of V in Vn, in G \ TV G (S) 
of length at most r' = m + 2r + 1 (assume that G \ TU G (S) is embedded in a surface X). Let 
S" = S U Vn- Observe that tw(G \ H r G (S")) < r and all r-protrusions of G have size at 
most m. Furthermore let X' be the surface in which G \ TU G (S") can be embedded. Then 
eg(S') < eg(S) < g. Hence by induction hypothesis, there exists a set S' such that 

\S'\ < \S"\ + (g - l)(m + 2r + 1) < \S\ + \V N \ + (g - l)(m + 2r + 1) < \S\ + g(m + 2r + 1), 

and V = K^ +3r+1 (S'). This concludes the proof. □ 

Proof of Theorem [2j Let us assume that II is quasi-compact. Fix t = c*rg where c* is a 
constant to be defined later. Let (G' = (V',E'), k'), k' < k, be a reduced instance with respect 
to reduction rule given by Lemma [71 Hence there is no extended t-protrusion of size more than 
c, where c is a constant appearing in the statement of Lemma [7J and (G, fc) € II if and only if 
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{G' , k') G II and G' G Q g . Hence, what remains to show is that | V^' | = 0(k). The proof for this 
is similar to the one given for Theorem [TJ 

Now we show that if (C = (V',E'),k') G II then \V'\ < 0(k). Since II is quasi-compact 
and (G',k') G II, there is an embedding of G into a surface of genus at most g and a set 
S C V' such that \S\ < r ■ k' and tw(G" \ R^, (S)) < r. Since all i-protrusions of G' are of 
size most c we have that all r-protrusions of G' are of size at most c. Hence by Lemma [TT1 
there exists a set S' such that \S'\ < \S\ + g ■ (c + 2r + 1) and V = B%, +C+1 (S'). Given 5', 
we apply Lemma [TOl and obtain a set 5" such that G' is (a, /?, 7)-structured around S", where 
a, 7 = 0(r<7|S"|) and /9 = 0(rg). This implies that 1/' can be partitioned into S", C±, C2, . . . , C 7 
such that iV G '(Ci) C S", \Na>(Ci)\ < (3 and tw(G"[C, U iV<y (<?<)]) < for every t < 7. Fix c* 
such that t > (3. This implies that U Nqi(Ci)] is a t-protrusion of G' and hence its size is 

bounded by c. Now we are ready to bound the size of V . 

7 

\V'\ < \S"\ + \d\ < \ S "\ + 7 • P ■ c = 0(r 3 g 3 c 2 k) = 0(k), 
i=i 

for fixed r, g and c. So given an input (G, k), if the size of the reduced graph is more than c • k 
for some c, we return NO else we have G" as the desired kernel. 

The proof for the case when n is quasi-compact is similar to the proof we gave for the case 
when n was compact and Tl a was an annotated p-MAX-CMSO problem in Theorem [TJ This 
concludes the proof. □ 



6 Implications of Our Results 

In this section we mention a few parameterized problems for which we can obtain either poly- 
nomial or linear kernel using either Theorem [J or Theorem [2j Various other problems for which 
we can obtain either polynomial or linear kernels using our results are mentioned in appendix. 

6.1 A Sufficient Condition for Finite Integer Index 

We first give a sufficient condition which implies that a large class of j»-min/max-CMSO prob- 
lems has finite integer index. We prove it here for vertex versions of p-Mw/ MAX-CMS O prob- 
lems that is if n is a p-min/max-CMSO problem then Pjj is a property of vertex sets. The 
edge version can be dealt in a similar manner. 

Let n be a p-MiN-CMSO problem and Tt be the set of pairs [G = (V, E), S) where G is a 
t-boundaried graph and S C V. For a t-boundaried graph G = (V, E) we define the function 
Cg •■ Ft -»■ N U {00} as follows. For a pair (G' = (V',E'), S f ) G T t , if there is no set 5 C V 
(S C E) such that P U (G®G', SuS r ) holds, then ( G ((G', S')) = 00. Otherwise Cg((G", 5')) is the 
size of the smallest SCV (S CE) such that P U {G® G', SU S') holds. If n is a p-MAX-CMSO 
problem then we define £g((G' , S')) to be the size of the largest S C V (S C E) such that 
Pn(G ©G',SU S') holds. If there is no set S C V {S C E) such that P U (G ®G',SU S') holds 
then Cg((G',5')) = 00. 

Definition 9. A p-MiN-CMSO problem n is said to be strongly monotone if there exists a 
function f : N — > N such that the following condition is satisfied. For every t-boundaried graph 
G = (V,E), there is a subset S C V such that for every (G 1 = (V , E'), S') G Tt such that 
Cg((G",5')) is finite, P n (G G' , S U S') holds and \S\ < ( G ((G' , S')) + f(t). 
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Definition 10. A p-MAX-CMSO problem U is said to be strongly monotone if there exists a 
function f : N — ► N such that the following condition is satisfied. For every t-boundaried graph 
G = (V,E), there is a subset S C V such that for every (G' = (V',E'),S') G Tt such that 
Cg((G', S')) is finite, P n (G ®G',SU S') holds and \S\ > Cg((G', S')) - f(t). 

Lemma 12. Every strongly monotone p-MlN-CMSO and p-MAX-CMSO problem has finite 
integer index. 

Proof. We prove for p-MiN-CMSO problems, the proof for p-MAX-CMSO is similar. Let II be 
a monotone p-MiN-CMSO problem. Then Pjj is a finite state property of t-boundaried graphs 
with a distinguished vertex set S |14} 122] . In particular for every t, there exists a finite set S 
of pairs (G, S) such that G = (V, E) is a t-boundaried graph and S C V such that the set S 
satisfies the following properties. For any t-boundaried graph G\ = (Vi,Ei) and set Si C V\ 
there is a pair (G2 = (V2, E2), S2) G S such that for every t-boundaried graph G3 = (Vs,E 3 ) 
and set S 3 C V 3 we have that Pu(Gi © G 3 , Si U S3) Pn(G 2 © G 3 , S 2 U S 3 ). We fix such a 

set 5. 

For a t-boundaried graph G = (V,E) we define the signature Co '■ <-> ~~ ^ N U {00} of G to 
be with domain restricted to S. We now argue that for any t-boundaried graph G, the 
maximum finite value of ^ s a ^ m ost f(t) larger than the minimum finite value taken by 
( G . Since II is strongly monotone there exists a subset S of V that satisfies the conditions of 
Definition M Let (G' = (V, E'), S') G S such that ( G {(G' , S')) is finite. Then P U {G®G', SUS') 
holds and hence £q((G',S')) < \S\. Furthermore the conditions of Definition 1101 imply that 
\S\ ~ fit) < Cg((G',S')) < \S\. Hence the minimum and the maximum finite values of (q can 
differ by at most f(t). By the pigeon hole principle there is a finite set 1Z of t-boundaried graphs 
such that for any t-boundaried graph G there is a Gr G 1Z and a constant cr depending only on 
the size of G and G R , such that for all (G', S') G S we have Cg r {{G', S')) + c R = ( G ((G', S')). 

We now argue that 1Z forms a set of representatives for (II, t). Let G = (V,E) be a t- 
boundaried graph and let Gr = (Vr, Er) S 1Z and cr be a constant such that for all (G', S') G S 
we have (g r ((G', S')) + c R = ( G ((G',S')). Let G' = (V, E') be a t-boundaried graph and 
k be an integer such that (G © G',fc) G II. We argue that (Gr © G',k — cr) G II. Let 
Z C V U V' be a minimum size set such that -Pn(G © G' , Z) is satisfied, Z' = ZPiV and 
Z G = Z \ Z'. Since (G © G", fc) G II we have that \Z\ < k and hence Cg(G', Z n V) is finite. 
Let (Gs,Zs) G 5 be the representative of (G',Z'). Then Pjj(G © Gs,Zg U Z5) holds and 
|-^g| = Cg(G , iS ,Z 5 ). Now we have that CG fl ((G 5 , Z s )) + c R = Cg{{G s ,Z s )). Hence, there 
is a set S^ C Vr of size |Zg?| — cr such that Pjj(Gr © Gs,Sr U Z5) holds. Then, since 
(Gs,Zs) is the representative of (G',Z') we have that -F\i(Gr © G',Sr U Z') holds. Now we 
have that \Sr U Z'| < |S^| + \Z'\ = \Zq\ — cr + |Z'| = |Z| — cr < k — cr. This implies that 
(Gr ®G',k — cr) G n. The proof for the other direction that if (Gr ® G' ,k - c R ) G n then 
(G © G', k) G n is symmetric. This concludes the lemma. □ 

6.2 Covering and Packing Problems 

Minor Covering and Packing: We give below a few generic problems which subsumes many 
problems in itself and fit in our kernelization framework. Let Ti be a finite set of connected 
planar graphs. 
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Vertex- H- Covering 

Input: A graph G = (V, E) E Q g and a non-negative integer k. 
Question: Is there an S C V such that |S*| < fe and G[V \ S] does not contain 
any of the graphs in TC as a minor? 



Vertex-W-Packing 

Input: A graph G £ Q g and a non-negative integer fe. 

Question: Does there exist vertex disjoint subgraphs Gi, . . . , G^ of G such 
that each of them contain some graph in TC as a minor. 



Lemma 13. Let TC be a finite set of connected planar graphs and let Yi\ denote Vertex- 
H-Packing. Then Vertex-H- Covering has finite integer index and is quasi- compact and 
Vertex- TC- Packing has finite integer index and Hi is quasi- compact. 

Proof. Let U v denote Vertex- W-Covering. We first show that ILj is quasi compact. If 
{G = {V, E),k) E ILj then we know that there exists a set S C V such that G[V \ S] does not 
contain any graph in TC as a minor. Now we show that the treewidth of G[V \ S] is at most a 
constant. To show this we need following results. 

• Every planar graph H = (Vh, Eh) is a minor of the r x r grid, where r = 14| Vjy | — 24 |51j . 

• For any fixed graph H, every ilf -minor free graph that does not contain & w x w grid as 
a minor has treewidth 0(w) [26J. 

Let q = max{\H\ \ H E Tt} and w = 14q — 24. Then observe that G[V \ S] does not contain 
w x w grid as a minor and hence tw(G[y \ S]) < O(w). This implies that n„ is quasi-compact. 

Next we show that LTi is quasi-compact. We first introduce some definitions. Given a graph 
G = (V,E), we define the covering number of G with respect to the class TL, cov-^(G), as the 
minimum k such that there exists S C V of size k such that G[V \ S] does not contain any of 
the graphs in TC as a minor. The packing number of G with respect to the class TC, is defined as, 

pack^(G) = max{k \ 3 a partition V\, . . . , Vk oiV such that 

^ie{i....,k}G\Yi] contains a graph in TC as a minor}. 

Less formally, pack^(G) > k if G contains k vertex-disjoint minors in TC. We need the following 
Erdos-Posa type of result shown in [33] for our purpose. 

Claim 1. Let TC be a finite set of connected planar graphs, q = max{|ff | | H E TC} and Q be a 
non-trivial minor-closed graph class. Then there is a constant ag^ q depending only on Q and q 
such that for every graph G E Q, it holds that 

pack w (G) < cover-ft(G) < ag :Q ■ pack w (G). 

Using Claim[T]we show that IT is quasi-compact. Observe that if (G, k) E III and (G, k+1) ^ 
IIi, then by Claim [TJ cover^(G) = 0(k). Hence by an argument, similar to the one used for 
showing that II„ is quasi-compact, we have that LTi is quasi-compact. 

It is known that ILj is a p-MiN-CMSO problem and LTi is a p-MAX-CMSO problem. Now 
using Lemma[12]we show that IL is finite integer index. We show that ILj is strongly monotone. 
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Given a i-boundaried graph G = (V, E), with d(G) as its boundary, let S" C V be a minimum 
set of vertices in G such that G[V \ S"] does not contain any graph in TC as a minor. Take 
S = S"Ud(G). Now for any (G' = (V, E'), S') G T t such that Cg((G', 5")) is finite we have that 
GffiG'[(yuV)\(* s ' u *S")] does not contain any graph in H as a minor and |5| < ( G ((G' , S')) + t. 
This proves that 11^ is strongly monotone. 

Next we show that Hi is finite integer index. Given a i-boundaried graph G = (V,E), we 
define its signature, £g> as follows. Let T'f C be the set of all i-boundaried graph of size at 
most qt and (g '■ ~^ ^. For everv G' G J^ 9 * we define Cg(G') = pack w (G © G"). For any 
G', G" g J?, Kg(G') - Cg(G")| < f + gt, as Cg(0) < Cg(G') < Cg(0) + gt + * for all G' g . 
Hence, by the pigeon hole principle there is a finite set 1Z of i-boundaried graphs such that for 
any i-boundaried graph G there is a Gr G 1Z and a constant cr depending only on the size of 
G and Gr, such that Cg^ + cr = Qg- 

We now argue that 1Z forms a set of representatives for (IIi,t). Let G = (V,E) be a f- 
boundaried graph and let Gr = (Vr, Er) G 1Z and cr be a constant such that (g r + cr = Cg- 
Let G' = (V', E') be a t-boundaried graph and k be an integer such that (G © G', k) G LTi. We 
argue that (Gr © G', fc — cr) G LTi. Let 5 be a set of vertex disjoint minors in G © G' and 

be a minor in S which goes across or touch the boundary d(G). Now contract the part of 
this minor belonging to the side of G' as much close to the boundary d(G) as possible. Since 
the size of largest graph in TC is at most q, we have that the part of the H belonging to the 
side of G' can be contracted to the border except for some q vertices, which could possibly 
be hanging out of the boundary. We do this for every minor in S which is going across. Let 
S' be the set of minors resulting after the contraction operation has been performed. Now 
we take the boundary d(G) and all the minors of S' hanging out of it, and call this resulting 
graph G = (V, E). Observe that we can have at most t minors in S' which can hang out of 
the boundary as these are vertex disjoint minors. Hence \V\ < qt and G is a i-boundaried 
graph. Now we know that (g r + C R = Cg an d hence ^G R (G) + cr = Cg(G). By definition 
Cg r (G) = pack w (G © G) = pack w (G © G'), and hence (Gr ®G',k- c R ) G IIi. The proof for 
the other direction that if (Gr © G',k — cr) G IIi, then (G © G', k) G Hi is symmetric. This 
concludes that Hi is finite integer index. □ 

Vertex-H- Covering contains various problems as a special case, for example: (a) Feed- 
back Vertex Set by taking H = {A} where A is a cycle of length at most 3; deleting at 
most k vertices to obtain a graph of fixed treewidth t; deleting at most k vertices to obtain 
a graph into a graph class M, where M is characterized by a finite set of minors. Similarly 
Vertex- W-Packing contains problem like Cycle Packing as a special case. 

Subgraph Covering and Packing: Let S be a finite set of connected graphs. 
Vertex-5- Covering 

Input: A graph G G Q g and a non-negative integer k. 

Question: Is there a, S C V such that \S\ < k and G[V \ S] does not contain 
any of the graphs in S as a subgraph? 

We similarly define Edge-5-Covering by demanding S C E in the above definition. 
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Vertex-S- Packing 

Input: A graph G G Q g and a non-negative integer k. 

Question: Does there exists k vertex disjoint subgraphs Gi,G2, ■■■■ > G). in G 
such that, for all i G{ is isomorphic to a graph in S. 

We define Edge-5-Packing by demanding that G%, . . . , G\» be edge disjoint subgraphs of G. 

We can not show that Vertex/Edge-5- Covering or Vertex/Edge-S-Packing or there 
no instances are compact unless we do the following simple preprocessing. 
Redundant Vertex and Edge Rule: Given an input (G = (V, E), k) to Vertex/Edge-5-Covering 
or Vertex/Edge-5-Packing remove all edges and vertices that are not part of any subgraph 
isomorphic to any graph in S. 

We can perform the Redundant Vertex and Edge Rule in 0(|V| • |<S|) time by looking at a 
small ball around an edge e or a vertex v and check whether the ball contains a subgraph 
isomorphic to a graph in S and contains the edge e or the vertex v. This algorithm to check a 
subgraph isomorphic to a given graph containing a particular vertex or edge appears in a paper 
by Eppstein [29J. 

Lemma 14. Let S be a finite set of connected graphs and U± and 1I2 correspond to Vertex- 
5-Packing and Edge-S-Packing respectively. Then the following hold: (a) Vertex-S- 
Covering has finite integer index and is compact; (b) Vertex-5-Packing has finite integer 
index and IT is compact; (c) Edge-5-Covering is p-MiN-CMSO problem and is compact; 
and (d) Edge-5- Packing is p-MAX-CMSO problem andU 2 is compact. 

Proof. Let IL and II e denote Vertex-S- Covering and Edge-S-Covering respectively. Let 
s = max{\V*\ I G* = (V*,E*) G S}. Without loss of generality we assume that an input (G, k) 
to all these problems are reduced with respect to Redundant Vertex and Edge Rule. 

We first show that these problems or their no instances are compact. Let {G, k) G IL, then 
we know that there is a S C V such that j^l < k and G[V \ S] does not contain any of the graphs 
in S as a subgraph and every vertex and edge is in some subgraph in G which is isomorphic to 
a subgraph in S. This implies that every vertex in u G V \ S is in at most r = O(s) distance 
away from a vertex in S and hence 'Bq(S) = V. We can similarly show that LT e is compact. 
Next we show that IIi is compact. Observe that if (G, k) G IT and (G, k + 1) ^ III then we 
know that we have a set Z of k vertex disjoint subgraphs in G where each of them is isomorphic 
to a subgraph in S. Take S as the union of all the vertices appearing in any of the subgraph 
in Z. Note that \S\ < s ■ k. Observe that S hits all the subgraphs isomorphic to a subgraph in 
S and hence Hq(S) = V, where r = O(s). This implies that LTi is compact. We can similarly 
show that LT2 is compact. 

It is well known that IL and II e are p-MlN-CMSO problems while IT and LT2 are p-MAX- 
CMSO problems. The proof that ILj and LTi are finite integer index is similar to the proof 
given for Vertex- H- Covering and Vertex-'H-Packing, where H is a finite set of connected 
planar graphs, have finite integer index in Lemma [13l The only thing we need to replace is 
minors by subgraphs in that proof. □ 

6.3 Domination and its Variants 

In the r-DOMiNATiNG Set problem, we are given a graph G = (V,E), and a positive integer 
k, and the objective is to find a subset S C V such that B r G (S) = V and \S\ < k. For 
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r = 1, if we demand that G[S] is connected then we get Connected Dominating Set. A 
problem is called ^-Threshold Dominating Set if we demand that Bq(S) = V and for all 
v £ (V \ S), \N(v) n S\ > q. An independent set C of vertices in a graph G = (V, E) is an 
efficient dominating set (or perfect code) when each vertex not in C is adjacent to exactly one 
vertex in C. In Efficient Dominating Set problem we are given a graph G = (V, E) and a 
positive integer k and the objective is to find an efficient dominating set of size at most k. 

Lemma 15. /--Dominating Set, Connected Dominating Set, (/-Threshold Dominat- 
ing Set and Efficient Dominating Set are compact and have finite integer index. 

Proof. All these problems are compact by definition. To show that these problems have finite 
integer index, we make use of Lemma [121 Clearly, they are p-MlN-CMSO problems. We now 
show that these problems are strongly monotone. We first show it for r-DOMiNATiNG Set. 
Given a t-boundaried graph G = (V,E), with d(G) as its boundary, let S" C V be a minimum 
r-dominating set of G. Take S = S" U d{G). Now for any (G' = (V',E'),S') £ T t such 
that Cg((G' , S')) is finite we have that S U S' is a r-dominating set and \S\ < Cg((G' , S')) + t. 
This proves that t-Dominating Set is strongly monotone. Similarly, we can show that q- 
Threshold Dominating Set is strongly monotone by taking S = S" U d(G) where S" C V 
is a minimum q-threshold dominating set of G. To show that Connected Dominating Set 
is strongly monotone we take S = S" U d{G) where S" C V is a union of minimum connected 
dominating set for each connected components of G. 

We now prove that Efficient Dominating Set has finite integer index. Let II be the 
Dominating Set problem and II' be the Efficient Dominating Set problem. The property 
P(G) that G has an efficient dominating set (of any size) is expressible in CMSO and hence this 
property is finite state in f-boundaried graphs. As argued in the above paragraph, Dominating 
Set has finite integer index. Furthermore by a theorem of Bange et al. [7], if a graph G has an 
efficient dominating set, then the size of the minimum efficient dominating set is equal to the 
size of the minimum dominating set of the graph. Hence for two t-boundaried graphs G\ , G2 
if G\ and G2 are in the same equivalence class of P and G\ =n G2 then G\ =n' Gi- Hence n' 
has a finite set of representatives. □ 

6.4 Problems on Directed Graphs 

Our results also apply to problems on directed graphs of bounded genus. In this direction we 
mention three problems considered in the literature. In Directed Domination [2] we are 
given a directed graph D = (V, A) and a positive integer k and the objective is to find a subset 
S C V of size at most k such that for very vertex u £ V \ S there is a vertex v £ S such 
that (u, v) G A. Independent Directed Dominatio^ [H] takes as input a directed graph 
D = (V, A) and a positive integer k and the objective is to find a subset S C V of size at most 
k such that S is an independent set and for every vertex u £ V \S there is a vertex v £ S such 
that (u, v) £ A. In the Minimum Leaf Out-branching [10] problem we are given a directed 
graph D = (V, A) and a positive integer k and the objective is to find a rooted directed spanning 
tree (with all arcs directed outwards from the vertices) with at least k internal vertices. 

Lemma 16. Independent Directed Domination is a p-min-CMSO compact problem, Di- 
rected Domination is compact and has finite integer index and n=MiNiMUM Leaf Out- 
branching is a p-MAX-CMSO problem and U is compact. 

1 In literature it is known as "Kernels". We call it differently here to avoid confusion with problem kernels. 
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Proof. Independent Directed Domination and Directed Domination can easily be seen 
to bep-MlN-CMSO problems and by their definition they are compact. Directed Domination 
can be shown to have finite integer index as follows: given a i-boundaried graph G = (V,E), 
with d(G) as its boundary, let S" C V be a minimum directed dominating set of G. Take 
S = S" U d(G). Now for any (G' = (V',E'),S') G T t such that ( G ((G',S')) is finite we have 
that S U S' is a directed dominating set and \S\ < Cg((G', S')) + 1. This proves that Directed 
Dominating Set is strongly monotone. 

Let II be the Minimum Leaf Out-branching problem. Observe that II is a p-MAX- 
CMSO problem. The set of no instances can be seen to be compact by observing the fact that 
if (G, k) £ LI and (G, k + 1) ^ LI then G has an out-branching with exactly k internal vertices 
with all other vertices being its leaves. This implies that IT is compact. □ 

Finally by applying Theorem [2] together with Lemmata [T2"l [TBT El fT5| and [16] we get the 
following corollary. 

Corollary 2. For g > 0, Feedback Vertex Set, Edge Dominating Set, Vertex Cover, 
Dominating Set, /--Dominating Set, ^-Threshold Dominating Set, Connected Dom- 
inating Set, Directed Domination, /--Scattered Set, Connected Vertex Cover, 
Minimum- Vertex Feedback Edge Set, Minimum Maximal Matching, Efficient Dom- 
inating Set, Independent Set, Induced g?-Degree Subgraph, Min Leaf Spanning 
Tree, Induced Matching, Triangle Packing, Cycle Packing, Maximum Full-Degree 
Spanning Tree, Cycle Domination, Vertex- W-Packing, Vertex- W-Covering, Vertex- 
5-Covering and Vertex-5- Packing admit a linear kernel on graph of genus at most g. 

Corollary [2] unifies and generalizes results presented in [21 I3T [TT] H2| [TBI IM l38l l39| 1321 146| 147] . 
By applying Theorem [lj Corollary [lj and Lemmata [T3l [TIT [ToT and [TB] we get the following 
corollary for problems which are not finite integer index. 

Corollary 3. For g > 0, Independent Dominating Set, Independent Directed Dom- 
ination, Minimum Leaf Out-branching, Odd Set, Edge-5-Covering and Edge-5- 
Packing admit a polynomial kernels on graphs of genus at most g. 

7 Open Problems and Further Directions 

This paper gives for the first time meta theorems, as called by Grohe and Kreutzer [36[ 144] , 
where logical and combinatorial conditions on problems lead to kernels of polynomial or linear 
sizes. Our results are very general in the sense that they can be applied to a large number of 
combinatorial problems on graphs on fixed surfaces and generalize a large collection of published 
results. Still, there are several directions in which our results could possibly be extended. We 
conclude the paper in this section with some new problems and further research directions 
opened by our results. 

Further extensions: The first natural question for further research is if our logical and 
combinatorial conditions can be extended to larger classes of problems. The condition that 
problems should satisfy some kind of compactness or quasi-compactness cannot be omitted. 
For instance, even though the problem of finding a path of length k is expressible in first order 
logic, it does not admit a polynomial kernel on planar graphs, unless polynomial hierarchy 
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collapses to the third level, a collapse which is deemed unlikely [ID] . Two interesting questions 
for further research are: 

• Can we replace the compactness condition with the weaker notion of quasi- compactness 
in Theorem QJ 

• Do all quasi-compact CMSO problems admit a linear kernel on graphs of bounded genus? 

It is very natural to ask whether our results can be extended to more general classes of graphs. 
The most natural candidates for such extensions are graphs of bounded local-treewidth and 
graphs excluding some fixed graph as a minor. 

Practical considerations: Our meta theorems provide simple criteria to decide whether 
a problem admits a polynomial or linear kernel. Our proofs are constructive and essentially 
provide meta- algorithms that construct kernels for problems in an automated way. Of course, 
it is expected that for concrete problems, tailor-made kernels will have much smaller constant 
factors, than what would follow from a direct application of our results. However, our approach 
might be useful for computer aided design of kernelization algorithms: with a Myhill-Nerode 
approach, a computer program can output a set of rules that transform each region to a minimum 
size representative and estimate the obtained kernel size. This seems an interesting and far from 
trivial algorithm-engineering problem. In general, finding linear kernels with small constant 
factors for concrete problems on planar graphs or graphs with small genus remains a worthy 
topic of further research. 

Some concrete open problems: We conclude with some concrete problems that cannot be 
resolved by our approach. These include Directed Feedback Vertex Set [18], and Odd 
Cycle Transversal [50] to name a few. All these problems are expressible in CMSO but 
none of them are known to be quasi-compact. For each of these problems we do not know if it 
has a polynomial kernel even on planar graphs, and we leave them open. 
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A Problem Compendium 

We refer to problem compendium given in [28J or the compendium of parameterized prob- 
lems provided at http://bravo.ce.uniroma2.it/home/cesati/research/compendium/ for 
the definitions of problems given below. 

A.l Problems that have Finite Integer Index and Quasi-Compactness Prop- 
erty — Linear Kernels 

Dominating Set, t-Dominating Set, ^-Threshold Dominating Set, Efficient Domi- 
nating Set, Vertex Cover, Connected t-Dominating Set, Connected Vertex Cover, 
Minimum- Vertex Feedback Edge Set, Vertex-W-Covering, Minimum Maximal Match- 
ing, Connected Dominating Set, Vertex-5-Covering, Clique-Transversal, Almost- 
Outerplanar, Feedback Vertex Set, Cycle Domination, Edge Dominating Set. 

A. 2 Problems that have Finite Integer Index and No Instances have Quasi- 
Compactness Property — Linear Kernels 

Independent Set, Induced c?-Degree Subgraph, /--Scattered Set, Min Leaf Spanning 
Tree, Induced Matching, Triangle Packing, Cycle Packing, Maximum Full-Degree 
Spanning Tree, Vertex-W-Packing, Vertex-5-Packing. 

A. 3 Problems that are not covered by IA.1I and IA.21 and are ^-min/eq/max- 
CMSO problems having Compactness Property — Polynomial Kernels 

Independent Dominating Set, Independent Directed Domination, Minimum Leaf 
Out-branching, Edge-5-Covering, Edge-5-Packing, Odd Set. 

A. 4 Problems that have Finite Integer Index but neither the problem nor 
its no instances satisfy Quasi-Compactness Property 

Minimum Partition Into Cliques, Hamiltonian Path Completion. 
A. 5 Problems that do not have Finite Integer Index 

Longest Path, Longest Cycle, Maximum Cut, Minimum Covering by Cliques, Inde- 
pendent Dominating Set, Minimum Leaf Out-branching, Odd Set. 

The following lemma is shown in |24j . 

Lemma 17. [24] Longest Path, Longest Cycle, Maximum Cut and Minimum Covering 
by Cliques are not of finite integer index. 
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